Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 24576 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.1 MiB |
| Average record size in memory | 88.0 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 1 |
ROLE_TITLE is highly correlated with ROLE_CODE | High correlation |
ROLE_FAMILY_DESC is highly correlated with ROLE_TITLE and 1 other fields | High correlation |
ROLE_CODE is highly correlated with ROLE_TITLE | High correlation |
ROLE_ROLLUP_1 is highly correlated with ROLE_ROLLUP_2 | High correlation |
ROLE_ROLLUP_2 is highly correlated with ROLE_ROLLUP_1 | High correlation |
ID is uniformly distributed | Uniform |
ID has unique values | Unique |
Reproduction
| Analysis started | 2022-10-15 03:03:44.232255 |
|---|---|
| Analysis finished | 2022-10-15 03:04:05.952520 |
| Duration | 21.72 seconds |
| Software version | pandas-profiling v3.3.0 |
| Download configuration | config.json |
| Distinct | 24576 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16367.64929 |
| Minimum | 0 |
|---|---|
| Maximum | 32768 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 192.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1614.75 |
| Q1 | 8149.75 |
| median | 16403.5 |
| Q3 | 24524.25 |
| 95-th percentile | 31135.25 |
| Maximum | 32768 |
| Range | 32768 |
| Interquartile range (IQR) | 16374.5 |
Descriptive statistics
| Standard deviation | 9464.173852 |
|---|---|
| Coefficient of variation (CV) | 0.5782243793 |
| Kurtosis | -1.19855179 |
| Mean | 16367.64929 |
| Median Absolute Deviation (MAD) | 8181.5 |
| Skewness | -0.001474675394 |
| Sum | 402251349 |
| Variance | 89570586.71 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 2270 | 1 | < 0.1% |
| 696 | 1 | < 0.1% |
| 20871 | 1 | < 0.1% |
| 15706 | 1 | < 0.1% |
| 14645 | 1 | < 0.1% |
| 13668 | 1 | < 0.1% |
| 25403 | 1 | < 0.1% |
| 11357 | 1 | < 0.1% |
| 4435 | 1 | < 0.1% |
| 19028 | 1 | < 0.1% |
| Other values (24566) | 24566 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 5 | 1 | |
| 9 | 1 | |
| 10 | 1 | |
| 11 | 1 | |
| 12 | 1 | |
| 13 | 1 |
| Value | Count | Frequency (%) |
| 32768 | 1 | |
| 32766 | 1 | |
| 32765 | 1 | |
| 32764 | 1 | |
| 32763 | 1 | |
| 32762 | 1 | |
| 32761 | 1 | |
| 32760 | 1 | |
| 32759 | 1 | |
| 32757 | 1 |
RESOURCE
Real number (ℝ≥0)
| Distinct | 6469 |
|---|---|
| Distinct (%) | 26.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 42881.13037 |
| Minimum | 0 |
|---|---|
| Maximum | 312153 |
| Zeros | 11 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 192.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3853 |
| Q1 | 20299 |
| median | 35210 |
| Q3 | 74189.25 |
| 95-th percentile | 81355 |
| Maximum | 312153 |
| Range | 312153 |
| Interquartile range (IQR) | 53890.25 |
Descriptive statistics
| Standard deviation | 34262.36267 |
|---|---|
| Coefficient of variation (CV) | 0.7990079173 |
| Kurtosis | 16.82170152 |
| Mean | 42881.13037 |
| Median Absolute Deviation (MAD) | 16792 |
| Skewness | 2.820603172 |
| Sum | 1053846660 |
| Variance | 1173909496 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 4675 | 638 | 2.6% |
| 79092 | 347 | 1.4% |
| 75078 | 321 | 1.3% |
| 25993 | 317 | 1.3% |
| 3853 | 295 | 1.2% |
| 75834 | 226 | 0.9% |
| 32270 | 224 | 0.9% |
| 6977 | 223 | 0.9% |
| 42085 | 188 | 0.8% |
| 1020 | 179 | 0.7% |
| Other values (6459) | 21618 |
| Value | Count | Frequency (%) |
| 0 | 11 | |
| 38 | 6 | |
| 136 | 1 | < 0.1% |
| 138 | 2 | < 0.1% |
| 153 | 8 | |
| 203 | 3 | < 0.1% |
| 216 | 3 | < 0.1% |
| 233 | 1 | < 0.1% |
| 237 | 2 | < 0.1% |
| 256 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 312153 | 1 | < 0.1% |
| 312152 | 1 | < 0.1% |
| 312140 | 1 | < 0.1% |
| 312139 | 1 | < 0.1% |
| 312132 | 1 | < 0.1% |
| 312131 | 1 | < 0.1% |
| 312130 | 4 | |
| 312129 | 3 | |
| 312122 | 1 | < 0.1% |
| 312121 | 1 | < 0.1% |
MGR_ID
Real number (ℝ≥0)
| Distinct | 3996 |
|---|---|
| Distinct (%) | 16.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25893.69328 |
| Minimum | 25 |
|---|---|
| Maximum | 311696 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 192.1 KiB |
Quantile statistics
| Minimum | 25 |
|---|---|
| 5-th percentile | 1140 |
| Q1 | 4564 |
| median | 13441 |
| Q3 | 41786 |
| 95-th percentile | 87834.75 |
| Maximum | 311696 |
| Range | 311671 |
| Interquartile range (IQR) | 37222 |
Descriptive statistics
| Standard deviation | 35746.79671 |
|---|---|
| Coefficient of variation (CV) | 1.380521362 |
| Kurtosis | 17.70545767 |
| Mean | 25893.69328 |
| Median Absolute Deviation (MAD) | 9774 |
| Skewness | 3.353856286 |
| Sum | 636363406 |
| Variance | 1277833475 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 770 | 111 | 0.5% |
| 2270 | 66 | 0.3% |
| 2594 | 63 | 0.3% |
| 1350 | 56 | 0.2% |
| 2014 | 54 | 0.2% |
| 16850 | 52 | 0.2% |
| 5396 | 50 | 0.2% |
| 7807 | 47 | 0.2% |
| 18213 | 46 | 0.2% |
| 18686 | 46 | 0.2% |
| Other values (3986) | 23985 |
| Value | Count | Frequency (%) |
| 25 | 18 | |
| 27 | 13 | |
| 30 | 5 | < 0.1% |
| 32 | 4 | < 0.1% |
| 33 | 21 | |
| 36 | 9 | |
| 43 | 2 | < 0.1% |
| 46 | 2 | < 0.1% |
| 47 | 8 | < 0.1% |
| 55 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 311696 | 13 | |
| 311683 | 5 | < 0.1% |
| 311682 | 1 | < 0.1% |
| 311651 | 2 | < 0.1% |
| 311597 | 1 | < 0.1% |
| 311433 | 4 | < 0.1% |
| 311355 | 2 | < 0.1% |
| 311338 | 1 | < 0.1% |
| 311251 | 1 | < 0.1% |
| 311203 | 2 | < 0.1% |
| Distinct | 123 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 116955.3704 |
| Minimum | 4292 |
|---|---|
| Maximum | 311178 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 192.1 KiB |
Quantile statistics
| Minimum | 4292 |
|---|---|
| 5-th percentile | 117902 |
| Q1 | 117961 |
| median | 117961 |
| Q3 | 117961 |
| 95-th percentile | 119134 |
| Maximum | 311178 |
| Range | 306886 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 10950.86747 |
|---|---|
| Coefficient of variation (CV) | 0.09363287406 |
| Kurtosis | 91.76494067 |
| Mean | 116955.3704 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -6.118413445 |
| Sum | 2874295184 |
| Variance | 119921498.4 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 117961 | 15994 | |
| 117902 | 562 | 2.3% |
| 91261 | 544 | 2.2% |
| 118315 | 379 | 1.5% |
| 118212 | 309 | 1.3% |
| 118290 | 299 | 1.2% |
| 119062 | 275 | 1.1% |
| 118887 | 252 | 1.0% |
| 118169 | 230 | 0.9% |
| 117916 | 225 | 0.9% |
| Other values (113) | 5507 | 22.4% |
| Value | Count | Frequency (%) |
| 4292 | 10 | < 0.1% |
| 5110 | 141 | 0.6% |
| 11146 | 22 | 0.1% |
| 91261 | 544 | |
| 117876 | 131 | 0.5% |
| 117882 | 14 | 0.1% |
| 117887 | 73 | 0.3% |
| 117890 | 177 | 0.7% |
| 117893 | 63 | 0.3% |
| 117902 | 562 |
| Value | Count | Frequency (%) |
| 311178 | 2 | < 0.1% |
| 247952 | 9 | < 0.1% |
| 216705 | 10 | |
| 209434 | 1 | < 0.1% |
| 203209 | 1 | < 0.1% |
| 192441 | 4 | < 0.1% |
| 183723 | 3 | < 0.1% |
| 147236 | 1 | < 0.1% |
| 138798 | 24 | |
| 132839 | 7 | < 0.1% |
| Distinct | 168 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 118260.8927 |
| Minimum | 23779 |
|---|---|
| Maximum | 286791 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 192.1 KiB |
Quantile statistics
| Minimum | 23779 |
|---|---|
| 5-th percentile | 117936 |
| Q1 | 118102 |
| median | 118300 |
| Q3 | 118386 |
| 95-th percentile | 119256 |
| Maximum | 286791 |
| Range | 263012 |
| Interquartile range (IQR) | 284 |
Descriptive statistics
| Standard deviation | 4841.345712 |
|---|---|
| Coefficient of variation (CV) | 0.04093784175 |
| Kurtosis | 354.6126288 |
| Mean | 118260.8927 |
| Median Absolute Deviation (MAD) | 86 |
| Skewness | -13.55252197 |
| Sum | 2906379700 |
| Variance | 23438628.3 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 118300 | 3342 | 13.6% |
| 118343 | 2993 | 12.2% |
| 118327 | 1960 | 8.0% |
| 118225 | 1895 | 7.7% |
| 118386 | 1321 | 5.4% |
| 118052 | 1238 | 5.0% |
| 117962 | 1162 | 4.7% |
| 118413 | 960 | 3.9% |
| 118446 | 722 | 2.9% |
| 118026 | 544 | 2.2% |
| Other values (158) | 8439 |
| Value | Count | Frequency (%) |
| 23779 | 23 | 0.1% |
| 31010 | 37 | 0.2% |
| 117877 | 131 | 0.5% |
| 117883 | 11 | < 0.1% |
| 117891 | 113 | 0.5% |
| 117894 | 63 | 0.3% |
| 117903 | 372 | |
| 117911 | 98 | 0.4% |
| 117917 | 61 | 0.2% |
| 117919 | 47 | 0.2% |
| Value | Count | Frequency (%) |
| 286791 | 1 | < 0.1% |
| 159716 | 8 | < 0.1% |
| 151110 | 7 | < 0.1% |
| 147237 | 1 | < 0.1% |
| 145248 | 6 | < 0.1% |
| 141176 | 4 | < 0.1% |
| 140550 | 1 | < 0.1% |
| 138799 | 24 | |
| 132840 | 1 | < 0.1% |
| 132564 | 1 | < 0.1% |
ROLE_DEPTNAME
Real number (ℝ≥0)
| Distinct | 440 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 118854.6591 |
| Minimum | 4674 |
|---|---|
| Maximum | 286792 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 192.1 KiB |
Quantile statistics
| Minimum | 4674 |
|---|---|
| 5-th percentile | 117878 |
| Q1 | 118391 |
| median | 118910 |
| Q3 | 120428 |
| 95-th percentile | 125016 |
| Maximum | 286792 |
| Range | 282118 |
| Interquartile range (IQR) | 2037 |
Descriptive statistics
| Standard deviation | 18639.57457 |
|---|---|
| Coefficient of variation (CV) | 0.156826621 |
| Kurtosis | 39.96228982 |
| Mean | 118854.6591 |
| Median Absolute Deviation (MAD) | 969 |
| Skewness | -0.375063125 |
| Sum | 2920972102 |
| Variance | 347433740.2 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 117878 | 856 | 3.5% |
| 117941 | 596 | 2.4% |
| 118514 | 467 | 1.9% |
| 117945 | 465 | 1.9% |
| 117920 | 454 | 1.8% |
| 117884 | 424 | 1.7% |
| 118403 | 410 | 1.7% |
| 119598 | 409 | 1.7% |
| 119181 | 400 | 1.6% |
| 120722 | 383 | 1.6% |
| Other values (430) | 19712 |
| Value | Count | Frequency (%) |
| 4674 | 32 | 0.1% |
| 5488 | 24 | 0.1% |
| 5606 | 7 | < 0.1% |
| 6104 | 43 | |
| 6725 | 79 | |
| 7646 | 6 | < 0.1% |
| 16232 | 62 | |
| 19666 | 64 | |
| 19772 | 96 | |
| 20807 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 286792 | 1 | < 0.1% |
| 277693 | 86 | |
| 275600 | 7 | < 0.1% |
| 274241 | 8 | < 0.1% |
| 272283 | 1 | < 0.1% |
| 253965 | 8 | < 0.1% |
| 240766 | 3 | < 0.1% |
| 225010 | 15 | 0.1% |
| 215920 | 3 | < 0.1% |
| 204054 | 3 | < 0.1% |
| Distinct | 331 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 125661.4926 |
| Minimum | 117879 |
|---|---|
| Maximum | 311867 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 192.1 KiB |
Quantile statistics
| Minimum | 117879 |
|---|---|
| 5-th percentile | 117885 |
| Q1 | 118274 |
| median | 118568 |
| Q3 | 120006 |
| 95-th percentile | 135809 |
| Maximum | 311867 |
| Range | 193988 |
| Interquartile range (IQR) | 1732 |
Descriptive statistics
| Standard deviation | 30491.34304 |
|---|---|
| Coefficient of variation (CV) | 0.2426466725 |
| Kurtosis | 24.99307548 |
| Mean | 125661.4926 |
| Median Absolute Deviation (MAD) | 604 |
| Skewness | 5.074521184 |
| Sum | 3088256842 |
| Variance | 929722000.1 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 118321 | 3456 | 14.1% |
| 117905 | 2701 | 11.0% |
| 118784 | 1368 | 5.6% |
| 117879 | 963 | 3.9% |
| 118568 | 800 | 3.3% |
| 117885 | 606 | 2.5% |
| 118054 | 567 | 2.3% |
| 118685 | 429 | 1.7% |
| 118777 | 419 | 1.7% |
| 118451 | 400 | 1.6% |
| Other values (321) | 12867 |
| Value | Count | Frequency (%) |
| 117879 | 963 | 3.9% |
| 117885 | 606 | 2.5% |
| 117896 | 126 | 0.5% |
| 117899 | 181 | 0.7% |
| 117905 | 2701 | |
| 117946 | 252 | 1.0% |
| 117985 | 13 | 0.1% |
| 118028 | 68 | 0.3% |
| 118043 | 185 | 0.8% |
| 118047 | 10 | < 0.1% |
| Value | Count | Frequency (%) |
| 311867 | 3 | < 0.1% |
| 310825 | 1 | < 0.1% |
| 307024 | 351 | |
| 299559 | 5 | < 0.1% |
| 297560 | 1 | < 0.1% |
| 280788 | 269 | |
| 279482 | 4 | < 0.1% |
| 273308 | 25 | 0.1% |
| 270690 | 1 | < 0.1% |
| 268608 | 1 | < 0.1% |
| Distinct | 2183 |
|---|---|
| Distinct (%) | 8.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 169860.2845 |
| Minimum | 4673 |
|---|---|
| Maximum | 311867 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 192.1 KiB |
Quantile statistics
| Minimum | 4673 |
|---|---|
| 5-th percentile | 117906 |
| Q1 | 117906 |
| median | 128628 |
| Q3 | 233714 |
| 95-th percentile | 306795 |
| Maximum | 311867 |
| Range | 307194 |
| Interquartile range (IQR) | 115808 |
Descriptive statistics
| Standard deviation | 69329.22149 |
|---|---|
| Coefficient of variation (CV) | 0.408154394 |
| Kurtosis | -0.6219345289 |
| Mean | 169860.2845 |
| Median Absolute Deviation (MAD) | 10722 |
| Skewness | 1.005623488 |
| Sum | 4174486352 |
| Variance | 4806540952 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 117906 | 5184 | 21.1% |
| 240983 | 937 | 3.8% |
| 117913 | 513 | 2.1% |
| 279443 | 473 | 1.9% |
| 117886 | 399 | 1.6% |
| 130134 | 315 | 1.3% |
| 117897 | 268 | 1.1% |
| 117879 | 255 | 1.0% |
| 168365 | 246 | 1.0% |
| 133686 | 242 | 1.0% |
| Other values (2173) | 15744 |
| Value | Count | Frequency (%) |
| 4673 | 13 | 0.1% |
| 62587 | 2 | < 0.1% |
| 117879 | 255 | 1.0% |
| 117886 | 399 | 1.6% |
| 117897 | 268 | 1.1% |
| 117899 | 49 | 0.2% |
| 117905 | 3 | < 0.1% |
| 117906 | 5184 | |
| 117913 | 513 | 2.1% |
| 117937 | 9 | < 0.1% |
| Value | Count | Frequency (%) |
| 311867 | 2 | < 0.1% |
| 311839 | 3 | < 0.1% |
| 311834 | 2 | < 0.1% |
| 311792 | 1 | < 0.1% |
| 311782 | 1 | < 0.1% |
| 311778 | 2 | < 0.1% |
| 311746 | 8 | < 0.1% |
| 311701 | 15 | 0.1% |
| 311635 | 9 | < 0.1% |
| 311622 | 147 |
ROLE_FAMILY
Real number (ℝ≥0)
| Distinct | 64 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 183598.0446 |
| Minimum | 3130 |
|---|---|
| Maximum | 308574 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 192.1 KiB |
Quantile statistics
| Minimum | 3130 |
|---|---|
| 5-th percentile | 19721 |
| Q1 | 118363 |
| median | 119095 |
| Q3 | 290919 |
| 95-th percentile | 292795 |
| Maximum | 308574 |
| Range | 305444 |
| Interquartile range (IQR) | 172556 |
Descriptive statistics
| Standard deviation | 100563.0915 |
|---|---|
| Coefficient of variation (CV) | 0.5477350903 |
| Kurtosis | -1.484153967 |
| Mean | 183598.0446 |
| Median Absolute Deviation (MAD) | 99374 |
| Skewness | -0.07835519969 |
| Sum | 4512105543 |
| Variance | 1.011293537 × 1010 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 290919 | 8278 | |
| 118424 | 2028 | 8.3% |
| 19721 | 2016 | 8.2% |
| 117887 | 1775 | 7.2% |
| 292795 | 968 | 3.9% |
| 118398 | 962 | 3.9% |
| 308574 | 942 | 3.8% |
| 118453 | 712 | 2.9% |
| 118331 | 660 | 2.7% |
| 118638 | 578 | 2.4% |
| Other values (54) | 5657 |
| Value | Count | Frequency (%) |
| 3130 | 109 | 0.4% |
| 4673 | 271 | 1.1% |
| 6725 | 69 | 0.3% |
| 19721 | 2016 | |
| 19793 | 279 | 1.1% |
| 117887 | 1775 | |
| 118131 | 125 | 0.5% |
| 118205 | 334 | 1.4% |
| 118295 | 363 | 1.5% |
| 118331 | 660 | 2.7% |
| Value | Count | Frequency (%) |
| 308574 | 942 | 3.8% |
| 292795 | 968 | 3.9% |
| 290919 | 8278 | |
| 270488 | 523 | 2.1% |
| 254395 | 2 | < 0.1% |
| 249618 | 162 | 0.7% |
| 161100 | 1 | < 0.1% |
| 155173 | 3 | < 0.1% |
| 151277 | 7 | < 0.1% |
| 149353 | 2 | < 0.1% |
| Distinct | 331 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 119765.3124 |
| Minimum | 117880 |
|---|---|
| Maximum | 270691 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 192.1 KiB |
Quantile statistics
| Minimum | 117880 |
|---|---|
| 5-th percentile | 117888 |
| Q1 | 118209 |
| median | 118570 |
| Q3 | 119353 |
| 95-th percentile | 125795 |
| Maximum | 270691 |
| Range | 152811 |
| Interquartile range (IQR) | 1144 |
Descriptive statistics
| Standard deviation | 5559.507074 |
|---|---|
| Coefficient of variation (CV) | 0.04642001063 |
| Kurtosis | 270.1023039 |
| Mean | 119765.3124 |
| Median Absolute Deviation (MAD) | 515 |
| Skewness | 13.5419255 |
| Sum | 2943352317 |
| Variance | 30908118.9 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 118322 | 3456 | 14.1% |
| 117908 | 2701 | 11.0% |
| 118786 | 1368 | 5.6% |
| 117880 | 963 | 3.9% |
| 118570 | 800 | 3.3% |
| 117888 | 606 | 2.5% |
| 118055 | 567 | 2.3% |
| 118687 | 429 | 1.7% |
| 118779 | 419 | 1.7% |
| 118454 | 400 | 1.6% |
| Other values (321) | 12867 |
| Value | Count | Frequency (%) |
| 117880 | 963 | 3.9% |
| 117888 | 606 | 2.5% |
| 117898 | 126 | 0.5% |
| 117900 | 181 | 0.7% |
| 117908 | 2701 | |
| 117948 | 252 | 1.0% |
| 117973 | 273 | 1.1% |
| 117987 | 13 | 0.1% |
| 118030 | 68 | 0.3% |
| 118046 | 185 | 0.8% |
| Value | Count | Frequency (%) |
| 270691 | 1 | < 0.1% |
| 268610 | 1 | < 0.1% |
| 266863 | 1 | < 0.1% |
| 258436 | 4 | |
| 254396 | 2 | < 0.1% |
| 247660 | 5 | |
| 239004 | 1 | < 0.1% |
| 216827 | 5 | |
| 212194 | 2 | < 0.1% |
| 208127 | 1 | < 0.1% |
ACTION
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 192.1 KiB |
| 1 | |
|---|---|
| 0 | 1428 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 24576 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 23148 | |
| 0 | 1428 | 5.8% |
Length
Histogram of lengths of the category
Category Frequency Plot
| Value | Count | Frequency (%) |
| 1 | 23148 | |
| 0 | 1428 | 5.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 23148 | |
| 0 | 1428 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 24576 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 23148 | |
| 0 | 1428 | 5.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 24576 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 23148 | |
| 0 | 1428 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24576 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 23148 | |
| 0 | 1428 | 5.8% |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| ID | RESOURCE | MGR_ID | ROLE_ROLLUP_1 | ROLE_ROLLUP_2 | ROLE_DEPTNAME | ROLE_TITLE | ROLE_FAMILY_DESC | ROLE_FAMILY | ROLE_CODE | ACTION | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2270 | 75078 | 255037 | 118315 | 118316 | 118202 | 118784 | 262095 | 290919 | 118786 | 1 |
| 1 | 696 | 79323 | 3120 | 117961 | 118300 | 120312 | 120313 | 120314 | 118424 | 120315 | 1 |
| 2 | 13514 | 34958 | 8243 | 118555 | 118178 | 118320 | 117905 | 117906 | 290919 | 117908 | 1 |
| 3 | 13400 | 39371 | 7520 | 117961 | 118343 | 124725 | 117905 | 240983 | 290919 | 117908 | 1 |
| 4 | 6703 | 39330 | 17290 | 117961 | 118386 | 118522 | 117905 | 117906 | 290919 | 117908 | 1 |
| 5 | 24671 | 15818 | 2017 | 117961 | 118327 | 121645 | 124886 | 147144 | 118643 | 124888 | 1 |
| 6 | 1451 | 25638 | 1755 | 117961 | 117962 | 119223 | 125793 | 146749 | 118643 | 125795 | 1 |
| 7 | 4215 | 33235 | 16973 | 117961 | 118300 | 124942 | 117905 | 117906 | 290919 | 117908 | 1 |
| 8 | 11822 | 39939 | 4924 | 117961 | 118300 | 120144 | 118054 | 124356 | 117887 | 118055 | 1 |
| 9 | 11537 | 80765 | 25607 | 117961 | 118343 | 118856 | 118321 | 117906 | 290919 | 118322 | 1 |
Last rows
| ID | RESOURCE | MGR_ID | ROLE_ROLLUP_1 | ROLE_ROLLUP_2 | ROLE_DEPTNAME | ROLE_TITLE | ROLE_FAMILY_DESC | ROLE_FAMILY | ROLE_CODE | ACTION | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 24566 | 16850 | 39262 | 5509 | 117961 | 118343 | 123454 | 118784 | 117906 | 290919 | 118786 | 1 |
| 24567 | 6265 | 35625 | 19717 | 117961 | 117962 | 118352 | 118321 | 117906 | 290919 | 118322 | 0 |
| 24568 | 22118 | 19751 | 7551 | 117961 | 118052 | 118867 | 118259 | 117906 | 290919 | 118261 | 1 |
| 24569 | 11284 | 20292 | 87977 | 117926 | 117927 | 117884 | 118568 | 281735 | 19721 | 118570 | 1 |
| 24570 | 11964 | 74226 | 51372 | 117961 | 118343 | 120666 | 118777 | 279443 | 308574 | 118779 | 1 |
| 24571 | 21575 | 971 | 4308 | 117961 | 118343 | 118833 | 118834 | 309123 | 118424 | 118836 | 1 |
| 24572 | 29802 | 1020 | 17386 | 117961 | 118446 | 119064 | 120690 | 130887 | 290919 | 120692 | 1 |
| 24573 | 5390 | 40474 | 32242 | 117961 | 118327 | 121979 | 117905 | 117906 | 290919 | 117908 | 1 |
| 24574 | 860 | 25553 | 66400 | 117910 | 117911 | 117920 | 123191 | 123191 | 19721 | 123192 | 1 |
| 24575 | 15795 | 34963 | 52925 | 117980 | 117981 | 117920 | 117879 | 117886 | 19721 | 117880 | 1 |